Determining Sample Size: Balancing Power, Precision, and Practicality

Internet Resources for Determining Sample Size Organized by Chapter

Book Image

published by Oxford University Press

Sample size determination is an important and often difficult step in planning an empirical study. From a statistical perspective, sample size depends on the following factors: type of analysis to be performed, desired precision of estimates, kind and number of comparisons to be made, number of variables to be examined, and heterogeneity of the population to be sampled. Other important considerations include feasibility, such as ethical limitations on access to a population of interest and the availability of time and money. The primary assumption of this book is that, within the context of ethical and practical limitations, efforts to obtain samples of appropriate size and quality remain an important and viable component of social science research.

This text describes the following available approaches for estimating sample size in social work research and discusses their strengths and weaknesses: (1) power analysis, (2) confidence intervals, (3) computer-intensive strategies, and (4) ethical and cost considerations. In addition, strategies for mitigating pressures to increase sample size, such as emphasis on model parsimony (e.g., fewer dependent and independent variables), simpler study designs, an emphasis on replication, and careful planning of analyses are discussed. This text covers sample-size determination for advanced and emerging statistical strategies, such as structural equation modeling, multilevel analysis, repeated measures MANOVA, and repeated measures ANOVA which are not discussed in other texts including Cohen (1988). This book can serve either as a supplemental text for an introductory course or as a core text for a more advanced course.

Chapter 1

Random.Org (p. 4)


Chapter 2

G*Power (p. 13)

NCSS Statistical and Power Analysis Software (p. 13)

Difference Between Two Proportions-Power and Sample Size (p. 21)

ANOVA-Power and Sample Size (p. 24)

Logistic Regression with Binary Covariate(s)-Power and Sample Size (p. 31)

Logistic Regression with Exposure Measurement Error (p. 32)

Mx is a matrix algebra interpreter and numerical optimizer for structural equation modeling and other types of statistical modeling (p.34)

SAS Macro Programs: csmpower (p. 34)

NIESEM is a program for calculating user-defined confidence intervals around noncentral fit indices (p. 34)

PinT calculates approximate standard errors for estimates of fixed effect parameters in hierarchical linear models with two levels (p. 36)

Optimal Design Software-optimal design for longitudinal and multilevel research (p. 37)

Optimal Design Software Manual (p. 37)


Chapter 3

Compute CI of a sum, difference, quotient, or product (p. 43)

Confidence interval calculator for MS Excel (p. 43)

Noncentral confidence interval and power calculations (SPSS scripts) (p. 44)

Construct confidence interval for Cohen's d (SPSS script) (p. 44)

Effect size calculator (MS Excel) (p. 44)

This page contains two calculators. This page contains calculators for the normal distribution. These calculators can be used instead of a table of the normal distribution. (p. 44)

Calculator for the confidence interval for the difference between two independent proportions (p. 45)

WHATIS.EXE calculates confidence intervals for a variety of statistics, probability and inverse probability values, and time-spans. (p. 45)

Noncentral confidence interval and power calculations (SPSS scripts) (p. 46)

Calculate for confidence interval around an odds ratio (p. 46)

Calculate confidence intervals around chi square (p. 46)

Noncentral confidence interval and power calculations (SPSS scripts) (p. 48)

PSY calculate confidence intervals around contrasts (p. 49)

Statistica power analysis module (p. 49)

Calculate confidence intervals around rho (p. 49)

Calculate the 99%, 95%,and 90% confidence intervals for a regression coefficient (p. 51)

Calculate the 99%, 95%,and 90% confidence intervals for R-squared (p. 51)

Calculate an adjusted R-squared (p. 51)

R2 calculates confidence intervals for R-squared (p. 51)

R2 manual [pdf] (p. 51)

Statsoft (Statistica) (p. 54)


Chapter 4

DataSim generates data that conform to user specifications (p. 63)

Random Number Generator (p. 63)

Simstat is a program for data analysis and simulation (p. 64)

David Howell's Resampling Statistics page (randomization and the bootstrap) (p. 66)

The following syntax generates 2,000 cases with three normally distributed variables:

set seed = 200409281.
input program.
loop #i = 1 to 2000.
compute x_normal = rv.normal(50,15).
compute y_normal = rv.normal(25,7).
compute z_normal = rv.normal(5,2).
end case.
end loop.
end file.
end input program.
formats x_normal (f8.4).
execute.
descriptives variables = x_normal y_normal z_normal /statistics = all.

The SPSS syntax file for example on pp. 64-65 is as follows:

DEFINE repsamp ().
!DO !doover = 1 !TO 40.
USE ALL.
do if $casenum = 1.
compute #s_$_1=40.
compute #s_$_2=1000.
end if.
do if #s_$_2 > 0.
compute filter_$ = uniform(1)* #s_$_2 < #s_$_1.
compute #s_$_1 = #s_$_1—filter_$.
compute #s_$_2 = #s_$_2—1.
else.
compute filter_$ = 0.
end if.
VARIABLE LABEL filter_$ ’10 from the first 1000 cases (SAMPLE).’
FORMAT filter_$ (f1.0).
FILTER BY filter_$.
T-TEST
GROUPS = group(1 0)
  /MISSING = ANALYSIS
  /VARIABLES = posttest
  /CRITERIA = CI(.95) .
!DOEND.
!ENDDEFINE.
repsamp.
execute.

* Line 2 directs SPSS to draw 40 samples
*Line 5 directs SPSS to draw samples of N = 40
*Line 6 defines the number of cases in the sample as 1,000.



Chapter 5

There are no www resources listed in this chapter.


Chapter 6

Simstat is a program for data analysis and simulation (p. 78)

G*Power Users Manual (p. 78)

Compute CI of a sum, difference, quotient, or product (p. 80)

Difference between two proportions-Power and Sample Size (p. 82)

Calculator for the confidence interval for the difference between two independent proportions (p. 85)

Calculate for confidence interval around an odds ratio (p. 89)

Calculate confidence intervals around chi square (p. 89)

Calculate confidence intervals around rho (p. 111)

Calculate the 99%, 95%,and 90% confidence intervals for R-squared (p. 112)

Paul Dungeon's Webpage with NIESEM, which is a program for calculating user-defined confidence intervals around noncentral fit indices (p. 125)

Construct confidence interval for Cohen's d (SPSS script) (p. 132)

Calculator Student's t, given a probability and degrees of freedom (p. 132)


Appendix

David Powell's power analysis webpage (p. 138)

Various statistical programs (p. 144)

PinT - calculates approximate standard errors for estimates of fixed effect parameters in hierarchical linear models with two levels (p. 145)

Optimal Design Software - Optimal design for longitudinal and multilevel research (p. 145)

Optimal Design Software Manual (p. 145)

G*Power (p. 145)

Mx is a matrix algebra interpreter and numerical optimizer for structural equation modeling and other types of statistical modeling (p. 145)

PS is an interactive program for performing power and sample size calculations (p. 145)

Statistical Power Calculator (exe) (p. 145)

Power analysis for single mean, paired means, and independent means t-tests (SPSS syntax file) (p. 145)

Power analysis for one and two proportions (SPSS syntax file) (p. 45)

UnifyPow-Power analysis (SAS script) (p. 146)

How to determine the power or the sample size needed to achieve the specified power of a test of two proportions (SAS) (p. 146)

nQuery Advisor (p. 146)

Power and Precision software (p. 146)

Statistica Power Analysis Module (p. 146)

Zuma Stats is a standalone statistical program that can be integrated with the menu bars of SPSS and Excel (p. 146)

PASS-Power and Sample Size (p. 146)

Stata (p. 146)

SAS (p. 146)

SPSS (p. 146)

S-PLUS (p. 146)

Statistica (p. 146)

Calculator the effect size for multiple regression (f2), given a value of R2 (p. 146)

Calculate the minimum required sample size, given the alpha level, the number of predictors, the anticipated effect size, and the desired statistical power level for a multiple regression model (p. 146)

Calculate the beta level of a study, given the observed alpha level, the number of predictors, the observed R2, and the sample size for a multiple regression model (p. 147)

Calculate the confidence interval for relative risk (p. 147)

Logistic Regression with Binary Covariate(s)-Power and Sample Size (p. 147)

Logistic Regression with Exposure Measurement Error (p. 147)

VassarStats webpage-contains a variety of web-based statistical tools (p. 147)

Calculate sample size for a study comparing means (p. 147)

Web-based sample size calculator (p. 147)

Web-based sample size calculator for a variety of statistical tests (p. 147)

Web-base sample size calculators (p. 147)

http://home.ubalt.edu/ntsbarsh/Business-stat/otherapplets/SampleSize.htm#rproptyp (p. 147)

Researcher's web-based statistical toolkit (p. 148)

Java applets for power and sample size (p. 148)

SISA -Simple Interactive Statistical Analysis s a web-based tool to calculate the significance r, significance of the difference between two r coefficients, power of r, sample size for r, partial r, and multiple r (p. 148)


Additional World Wide Web Resources (Not Referenced in the Text)

Probability Distributions and Related Distributions

Calculator for confidence intervals for odds ratio unmatched case control study

Comparing Single Proportion to Known Population

Computer Software for Power Analysis in Covariance Structure Modeling

Distribution Density Calculators, Plotters and RNG's

Calculate sample size necessary to detect the difference between two proportions

Various web-based statistical tools

Free Statistics Calculators

Free Web Development Resources

Free Web Development Resources - Overview of the Web Color Model

GraphPad QuickCalcs Confidence interval of a SD

Interactive Statistical Calculation Pages

Interactive Statistical Calculation Pages2

Interactive Statistical Calculation Pages

JavaStat -- Retrospective Power Calculations

Kevin's Home Page

Martindale's Calculators On-Line Center

Martindale's Calculators On-Line Center Statistics - Courses

MathPages

MetaCalcPage

Noncentral Student CDF Calculator

Power Analysis for ANOVA Designs

Power Slates in HyperCourseware

Power and Sample Size Calculation for Logistic Regression with Binary Covariate(s)

Sample Size Calculations for Logistic Regression with Exposure Measurement Error

Sample size calculator

Sample Size Calculator by Raosoft, Inc.

Sample Size for Controlled Trials

Samplesize Home Page

Sample Size Calculator

Section of Medical Statistics

Statistical Applets

Statistical Calculators

The Ideas Behind Statistical Power

Java applet that demonstrates features of ANOVA

Victor Bissonnette's Home Page (useful statistical tables and more)

Data Generator is a free, open source script written in JavaScript, PHP and MySQL that lets you generatecustom data in a variety of formats.

Factorial Calculator

Calculate combinations and permutations

Sort Text

Sort Numbers

Sort References

Raynald's SPSS Tools contains 675 plus examples of syntax, macros, and scripts

Easy Calculation.com-Online Statistics Calculators for mean, median, mode, standard deviation, geometric mean, grouped data arithmetic mean, class interval arithmetic mean, root mean square, correlation coefficient, regression, harmonic mean.

last updated: 02/22/2008

back to top